Semi-supervised Learning of Utterances Using Hidden Vector State Language Model
نویسندگان
چکیده
Spoken dialogue system has an uncertain parameter during the speech recognition which controls its performance that vary for the different users as well as for the same user during multiple repetitions of even the same dialogue. This paper discusses how recognition errors in the users utterances can be handled by making use of semi-supervised learning techniques over the hidden vector state (HVS) model. The HVS Model is an extension of basic Markov model in which the context is encoded in each state as a vector. The state transitions in the HVS are factored into a stack shift operation similar to the push-down automaton. HVS-Model being a statistical model requires lot of labeled training data which is practically difficult. In this paper we present how classification and expectation-maximization semi-supervised learning approaches can be trained on both labeled and unlabelled corpora for handling the uncertainty by the user as well as the recognition errors by speech recognition system. The experimental results show that the proposed framework using the HVS model can improve the performance of the dialogue management of the spoken dialogue system when compared with the baseline model.
منابع مشابه
Semi-Supervised Transductive Speaker Identification
We present an application of transductive semi-supervised learning to the problem of speaker identification. Formulating this problem as one of transduction is the most natural choice in some scenarios, such as when annotating archived speech data. Experiments with the CHAINS corpus show that, using the basic MFCC-encoding of recorded utterances, a well known simple semi-supervised algorithm, l...
متن کاملCombining active and semi-supervised learning for spoken language understanding
In this paper, we describe active and semi-supervised learning methods for reducing the labeling effort for spoken language understanding. In a goal-oriented call routing system, understanding the intent of the user can be framed as a classification problem. State of the art statistical classification systems are trained using a large number of human-labeled utterances, preparation of which is ...
متن کاملSemi-supervised learning of the hidden vector state model for extracting protein-protein interactions
OBJECTIVE The hidden vector state (HVS) model is an extension of the basic discrete Markov model in which context is encoded as a stack-oriented state vector. It has been applied successfully for protein-protein interactions extraction. However, the HVS model, being a statistically based approach, requires large-scale annotated corpora in order to reliably estimate model parameters. This is nor...
متن کاملSemi-supervised Learning for Spoken Language Understanding Using Semantic Role Labeling
In a goal-oriented spoken dialog system, the major aim of language understanding is to classify utterances into one or more of the pre-defined intents and extract the associated named entities. Typically, the intents are designed by a human expert according to the application domain. Furthermore, these systems are trained using large amounts of data manually labeled using an already prepared la...
متن کاملActive learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion
We propose a unified global entropy reduction maximization (GERM) framework for active learning and semi-supervised learning for speech recognition. Active learning aims to select a limited subset of utterances for transcribing from a large amount of un-transcribed utterances, while semi-supervised learning addresses the problem of selecting right transcriptions for un-transcribed utterances, s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012